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Abstract. Marine bivalve mollusks synthesize, in the 
phenol and accessory glands of the foot, proteins that in¬ 
tegrate the post-translationally hydroxylated amino acid 
3,4-dihydroxyphenylalanine (DOPA) into their primary 
sequence. These polyphenolic proteins serve as structural 
and adhesive components of the byssal threads which form 
the extraorganismie holdfast. One family of byssal pre¬ 
cursors, previously characterized in a number of mytiloid 
species, consists of proteins between 70-130 kDa con¬ 
taining 8-18 mol % DOPA. The high molecular weight 
precursor isolated from the foot of the blue mussel (My¬ 
tilus edulis Linnaeus. 1758) is here designated as Mefp- 
1. We now present evidence for the occurrence, in M. 
edulis, of a second, structurally unrelated, family of DOPA 
proteins (Mefp-2) of about 42-47 kDa. These novel pro¬ 
teins contain 2-3 mol % DOPA and, in startling contrast 
to Mefp-1, are also enriched in the disulphide-containing 
amino acid cystine (6-7 mol %). Consideration of the 
amino acid compositions of Mefp-1 and 2 and of the ter¬ 
minal adhesive plaques of byssal threads suggests that 
Mefp-2 makes up about 25% of plaque protein, whereas 
Mefp-1 content is about 5%. The Mefp-2 family exhibits 
electrophoretic microheterogeneity, but members share 
similar N- and C-terminal amino acid sequences. Analysis 
of peptides isolated after tryptic hydrolysis suggests that 
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the primary sequence of Mefp-2 is tandemly repetitive, 
with at least three types of motif. The sequence degeneracy 
of the motifs is greater than in Mefp-1. Mefp-2 has min¬ 
imal sequence homology with known structural proteins 
and may be a structural element of the plaque matrix. 

Introduction 

Mussels make a secure adhesive holdfast (byssus) that 
bonds tightly to wet and irregular surfaces without the 
extraordinary preparative treatments required when hu¬ 
mans try their hand at similar tasks (Waite, 1987). The 
byssus, a collagenous extraorganismie tendon, is a bundle 
of threads that terminate in an adhesive plaque. The bys¬ 
sus is synthesized by a muscular organ, the mussel foot 
(see schematic diagram in Fig. 1). The thread coalesces 
from the secretions of various glands lining the ventral 
groove of the foot, while the plaque forms at the distal 
depression. The collagenous core of the thread is secreted 
by the collagen gland (Vitellaro-Zuccarello, 1980), and a 
protective varnish, made of an tf-diphenolic resin coupled 
with a curing enzyme, is secreted by the accessory gland 
and applied to the thread cortex (Brown, 1952: Pujol. 
1967; Vitellaro-Zuccarello, 1981). The resin is known to 
be a 130 kDa, tandcmlv repetitive protein (here called 
Mcfp-1, Mytilus edulis foot protein), which incorporates 
into its primary structure a high proportion of 3.4-dihy- 
droxyphenylalanine (DOPA), synthesized post- or co- 
translationally through the action of a tyrosyl hydroxylase 
(Waite, 1983; Benedict and Waite, 1986a). The curing 
enzyme is a eatecholoxidase, activated after secretion of 
the resin, which converts the peptidyl DOPA residues into 
peptidyl DOPA-quinone, a probable cross-linking agent 
during selerotization (Waite, 1990; Rzepecki and Waite, 
1991, for reviews). 
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The plaque is formed from glandular secretion at the 
distal depression, and V i gland involved appears 
to be the phenol < ailed because of its intense 

athnity forpher nistological reagents (Tamarin 

el ai, 1976). A s to Melp-1 bind strongly to phenol 

and accessor. as (Benedict and Waite. 1986a), and 
Mefp-1 has been isolated from dissected phenol glands 
(Waite, 1983). Mefp-1 has also been identified as the ad¬ 
hesive at the interface between plaque and substratum 
(Benedict and Waite,' 1986a), and its physicochemical 
properties are consistent with such a role (Filpula et ai, 
1990). 

Little is known, however, about the components of the 
plaque matrix. We have now characterized another po¬ 
tential ingredient of phenol gland secretion, Mefp-2, which 
may be a structural element of plaque matrix. Mefp-2 
incorporates some DOPA, but is distinct from Mefp-1 in 
composition, sequence, and physicochemical properties, 
and hence probably serves a different role in byssal struc¬ 
ture. 

Materials and Methods 

Purification of DOPA proteins 

Cultured blue mussels (Mytilus echilis L.) were obtained 
via Rehoboth Seafood Market (Rehoboth Beach, Dela¬ 
ware) after being shipped live from Maine on ice. Feet 
were excised within two days of original shipment and 
stored in 30-g lots at -70°C. 

The DOPA proteins were extracted and purified, as 
described by Pardo et ai (1990) and modified by Rzepecki 
et ai (1991). Briefly, 30 g of mussel feet were homogenized 
in a Waring blendor in 300 ml 0.7% perchloric acid, and 
the homogenate was sedimented at 31,000 X g to remove 
particulate matter. The DOPA proteins precipitated upon 
the addition of acetone to 66% final volume. The precip¬ 
itate was resolubilized in 5-7 ml 5% acetic acid and then 
fractionated, in the same buffer, on a 2.5 X 90 cm column 
of Sephadex G-200 or G-150 (Pharmacia LKB Biotech¬ 
nology Inc., Piscataway, New Jersey). The eluate was 
monitored at 280 nm. The final purification of Mefp-2 
was achieved as follows. The lyophilized Sephadex frac¬ 
tions were chromatographed in 5% acetic acid on a 2.5 
X 90 cm ci '"nm of Sephacryl S-300 or S-400 (Pharmacia) 
monitored '80 nm. This step was followed by high 
pressure liqi ^matography (HPLC) on a 0.7 X 25 
cm semi-prep. Brownlee Aquapore RP-300 (C 8 ) re¬ 
verse phase cok Rainin Instrument Co., Woburn, 
Massachusetts) wit bait HPX solvent delivery system 
(Rainin); monitoring as at 280 nm with a Model 116 
dual wavelength detecto ilson Medical Electronics Inc., 
Middleton, Wisconsin). 1 he acetonitrile/water solvent 
gradients contained 0.1% trifluoracetic acid. 


In all of the chromatography procedures, the separation 
of Mefp-1 and Mefp-2 was assessed by gel electrophoresis 
(see below). Proteins were lyophilized after HPLC puri¬ 
fication and stored dry at —20°C. DOPA concentrations 
were determined by the Arnow (1937) method (Waite 
and Tanzer, 1981), and the protein concentrations were 
determined according to Bradford (1976), with a Bio-Rad 
(Richmond, California) reagent kit. 

The localization of Mefp-2 was studied in specimens 
of AI. edulis that were collected locally (Roosevelt Inlet, 
Lewes, Delaware), kept in flowing seawater at 12-15°C, 
and allowed to deposit byssal threads on to plexiglass 
plates. In one set of experiments, some mussel feet were 
excised and serially sectioned, as illustrated in Figure la. 
The acid-urea soluble protein was extracted from each 
section by homogenization in a small glass homogenizer 
containing 200-300 p\ of 5% acetic acid, 8 M urea. The 
particulate matter was sedimented at 13,000 X g for 5 
min and was then analyzed for acid-soluble proteins by 
electrophoresis. In a second set of experiments, bvssi were 
collected 24 h after their synthesis and rinsed in water; 
the plaques and threads were then separated with a scalpel. 
About 110-140 mg and 150-175 mg wet weight of plaques 
and threads, respectively, were homogenized, with a little 
silica powder (Silica Gel G, 250 pm Analtech, Newark, 
Delaware) to provide abrasion, in either 400 p\ 5% acetic 
acid with 8 AI urea, or in 200 p\ of the sample buffer used 
for SDS-PAGE (Laemmli, 1970); the particulate matter 
was removed as above. Polyclonal anti-Mefp-2 rabbit an¬ 
tibodies were contractually prepared by Cambridge Re¬ 
search Biochemicals (Cambridge, Great Britain), and were 
used to determine the immunoreactivity of thread and 
plaque extracts and control Mefp-2 (Blake et ai, 1984). 
Dot blots on nitrocellulose were exposed for 1 h to a 1() -5 
dilution of serum, or a 10 4 dilution of pre-immune 
serum. Binding was detected with goat anti-rabbit IgG- 
coupled alkaline phosphatase (Boehringer-Mannheim, 
Indianapolis, Indiana). 

S-Alkvlation of Mefp-2 

5-Alkylation of Mefp-2 to 5-carbamidomethyl-Mefp- 
2 (CAMC-Mefp-2) or 5-carboxymethvl-Mefp-2 (COMC- 
Mefp-2) was effected after the cystine residues had been 
reduced by a modification of a procedure of Hollecker’s 
(1990). Stock solutions of Mefp-2 (10 mg/ml) were freshly 
prepared in H 2 0. To 80 ^1 Mefp-2 was added 20 p\ of 0.1 
AI EDTA, pH 7.0, which caused precipitation. After the 
addition of 400 p\ of NBA buffer (0.5 M boric acid, 0.5 
M ascorbic acid, brought to pH 8.0 with 6 N NaOH), the 
precipitate rcdissolvcd: 20 p\ of fresh 1 M dithiothreitol 
(DTT) together with 1.15 ml of 8 M urea w ? ere then added, 
and the reaction mixture was incubated for 40 min at 
room temperature. After the reduction, the cyteines were 


MUSSEL CYSTINE-RICH DOPA PROTEINS 


125 


S-alkylatcd by the addition of fresh iodoacetamide or io- 
doacetate (300 p\ of a 0.25 M stock in a 1:4 dilution of 
the NBA buffer for iodoacetamide, or, for iodoacetate, 
undiluted NBA buffer containing 0.6 of 6 A r NaOH per 
mg iodoacetate), and the reaction mixture was incubated 
for a further 30 min. The reaction was terminated by the 
addition of 20 ^1 glacial acetic acid, and the proteins were 
immediately separated from the low molecular weight 
solutes by HPLC, as described. Control alkylations of 
Mefp-2 in buffers with urea, but no DTT, were also per¬ 
formed. Protein fractions were lyophilized, redissolved in 
H 2 0, and stored at 4°C for up to 1 week before use. 

Peptide preparation and purification 

Native Mefp-2 or its alkylated derivatives were incu¬ 
bated (in 100 m M Tris, 100 m M ascorbate, pH 7.5, for 
7 h at room temperature) with trypsin (Boehringer 
Mannheim, Indiana), at a trypsin to protein weight ratio 
of 1:25 or 1:50 for native and alkylated Mefp-2, respec¬ 
tively. Mefp-2 concentrations were about 1 mg/ml. The 
incubation was terminated by the addition of glacial acetic 
acid to 5% final volume, and the peptides were resolved 
directly by HPLC on a 0.46 X 25 cm Microsorb Q 8 reverse 
phase column (Rainin), as described above. In an effort 
to recover the DOPA peptides specifically. Mefp-2 (0.5 
mg/ml) was treated with trypsin in 400 p\ of a 1:2 dilution 
of the above buffer for 6 h, then loaded on a small column 
containing 0.4 ml of phenyl boronate agarose (PBA-30, 
Lot # JM-2134D with 37.8 ^Moles boron/ml; Amicon 
Corp., Danvers, Massachusetts) which had been equili¬ 
brated in a 1:5 dilution of the above buffer. Bound peptides 
were w ashed tw ice w ith 0.4 ml of the diluted buffer, twice 
more with 0.4 ml of deionized water, and eluted with 
three washes of 5% acetic acid. Eluted peptides w^ere sep¬ 
arated by HPLC. 

Amino acid analysis and peptide sequencing 

The amino acid compositions of the hydrolyzed pro¬ 
teins and peptides were determined on a Beckman System 
6300 Autoanalyzer (Beckman Instruments, Palo Alto, 
California) using the ninhvdrin reaction for detection 
(Waite, 1991). Polypeptides were usually hydrolyzed by 
the rapid method of Tsugita et aL in 5 N HC1 with 8% 
trifluoracetic acid and 8% phenol in vacuo at 158°C for 
22 min, except that hydrolysis was effected in the bulk 
phase, rather than the vapor phase as in the original 
method. Unfortunately, the high concentrations of phenol 
required for quantitative yields of DOPA caused the re¬ 
action of phenol and cystine to give an unidentified prod¬ 
uct which co-migrated with histidine on the amino acid 
analyzer. Consequently, the amino acid compositions of 
native (non-alkylated) Mefp-2 were determined after hy¬ 
drolysis in 6 N HC1, for 24 h at 105°C. in the presence 


or absence of 10% phenol. DOPA, tyrosine, serine, and 
threonine values were recovered from phenolic hydroly¬ 
sates, while cystine and histidine values were obtained 
from aphenolic hydrolysates; we deduced a corrected 
composition from both data sets using the lysine value as 
a common factor. In the case of reduced and alkylated 
Mefp-2, alkylated cysteine was recovered after hydrolysis 
as S-carboxymcthylcysteine (S-COMC) which co-mi¬ 
grated on the amino acid analyzer with 3-hydroxyproline. 
Because native Mefp-2 contained no 3-hydroxyproline, 
the content of N-COMC was simply determined by ap¬ 
plying a correction factor of 2.23 to the reported 3-hy¬ 
droxyproline values. This correction factor was empiri¬ 
cally determined by running a concentration series of N- 
COMC standards (Sigma) on the analyzer. Peptides and 
proteins were sequenced on a microsequenator (Porton 
Instruments, Tarzana, California) using automated Ed- 
man degradation. PTH-amino acid derivatives were re¬ 
solved and quantitated by HPLC as previously described 
(Waite, 1991). Additional N-terminal analysis was per¬ 
formed on 4.6 mg/ml Mefp-2 in a buffer consisting of 46 
mA/ sodium phosphate, 18 mM sodium borate, and 18 
mA/ sodium ascorbate, pH 7.0. Incubations were per¬ 
formed at room temperature for different time intervals 
and the released amino acids were analyzed directly. The 
hog kidney aminopeptidase M (Boehringer Mannheim) 
used in this N-terminal analysis was prepared by dialysis 
against 50 mA/ sodium phosphate, pH 7.0, at a nominal 
concentration of 0.09 mg/ml. 

Gel electrophoresis and isoelectric focussing 

Acetic acid-urea polyacrylamide gel electrophoresis 
(AU-PAGE) was performed according to Panyim and 
Chalkley (1969) with 5% polyacrylamide, 5% acetic acid 
and 8 M urea, pH 2.7 (Rzepecki ex aL. 1991). Discontin¬ 
uous sodium dodecylsulphate polyacry lamide gel electro¬ 
phoresis (SDS-PAGE) was performed according to Lae- 
mmli (1970), except that various concentrations of DTT 
were used in the sample treatment buffer. Low molecular 
mass protein standards ranged from 14.4-97.4 kDa (Bio- 
Rad, California). Gels were stained for protein in 0.001% 
Coomassie Blue R-250 in 7.5% acetic acid, 40% methanol. 

The DOPA proteins were visualized by staining the 
gels with the Arnow' (1937) method, as described by Waite 
and Tanzer (1981). We also used a new' redox cycling 
method involving the production of formazan from ni- 
troblue tetrazolium (NBT) in the presence of glycine at 
pH > 10 (Paz et aL, 1991), but at a small cost in sensitivity, 
did not usually transfer the proteins eleetrophoretically 
to nitrocellulose as recommended by the authors. For Ar- 
now assays, SDS gels were first acidified by equilibration 
in 5% acetic acid; for NBT stain, all gels were washed 
twice in 50-100 ml of 0.2 A/ sodium borate, pH 8.5, to 
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Figure 1. Localization ofMefp-2 in Mytilus editlis foot and bvssus. (A) ALJ-PAGE analysis of acid-urea 
soluble proteins from a serially sectioned mussel foot. Lanes 1-5 correspond to the indicated foot sections 
and were stained with Coomassie Blue. Lanes 1-5' correspond to the same sections run on a parallel gel 
stained by the NBT redox cycling assay. The drawn parallel bars indicate position of Mefp-1 (upper) and 
Mefp-2 (lower) proteins. Schematic codes are: S, stem of the byssus; CG, collagen gland; AG. accessory' 
gland; VG. ventral groove; DD. distal depression; and PG, phenol gland. (B) SDS-PAGE (15% polyacry lamide) 
of proteins extracted from threads and plaques in SDS and neutral buffer. Gels stained as indicated; symbols 
P, T, M-2 and St refer to: plaque extract, thread extract, pure Mcfp-2, and molecular weight standards, 
respectively; arrows indicate Mefp-2. (C) AU-PAGE analysis of acetic acid-urea soluble proteins from threads 
and plaques. Gels stained as indicated; symbols are as in (B); arrows indicate Mefp-2. The higher molecular 
weight aggregates apparent in pure Mefp-2 in (B) and (C) w r ere artefactual results of prolonged storage in 
water at — 20°C and were never found in freshly isolated protein. Arnow tests w r ere not performed in (B) 
and (C). (D) Dot blots of pure DOPA proteins, and thread and plaque acetic acid-urea soluble proteins, 
stained as indicated. The amount of total protein blotted is given in the center: symbols are as in (B). 


remove the electro .is buffers before adding the NBT 
redox cycling reageir v of the NBT redox cycling assay 
was prefercd in this stu since, though it is somewhat 
less specific than the Arno in detecting o-diphcnols, 

it is considerably more sensitive, and lhe stain lasts in¬ 
definitely, once the gels have been washed first in alkaline 


borate and then in 5% acetic acid. Unless specified, how¬ 
ever. the presence of DOPA in electrophoretic protein 
species was confirmed at least once by the Arnow stain 
before routine use of the NBT assay. 

Glycoproteins were detected by the periodic acid-Schiff 
(PAS) stain (Segrest and Jackson, 1972), with bovine a r 
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acid glycoprotein (Sigma Chemical Co., St. Louis, Mis¬ 
souri) and collagen (type 11 calf skin) serving as positive 
controls. 

Denaturing isoelectric focussing (IEF) was performed 
with pH 3-10 ampholytes (FMC Bioproducts, Rockland, 
Maryland) in 8 M urea, 10% glycerol, 6% triton X-100, 
using 0.02 M acetic acid and 1 N NaOH as the anolyte 
and catholyte, respectively (Guilian ct al., 1984). Gels were 
pre-equilibrated (200 V, 15 min; 500 V, 30 min; 800 V, 
30 min) before samples (in the above buffer with 15 mg/ 
ml DTT) were applied to the gel under the anolyte. Pro¬ 
teins were focussed for 2-3 h at 1000 V, and a lane was 
removed for pH gradient determination (by incubation 
of 0.5 cm gel slices overnight in 1 ml of 0.1% NaCl) before 
the gel was stained. 


Results and Discussion 

Localization ofMefp-2 in M. edulis byssits 

DOPA proteins extracted from serially sectioned mussel 
feet and electrophoretically resolved by AU-PAGE were 
detected by the NBT redox cycling assay and with Coo- 
massie Blue (Fig. la). Mefp-1 migrated as a doublet iden¬ 
tical to the previously characterized 130 kD DOPA protein 
(Waite, 1983), and it was found in foot sections through¬ 
out the length of the ventral groove corresponding to ac¬ 
cessory and phenol glands. Mefp-2 was exclusively asso¬ 
ciated with the distal foot section that encompasses the 
phenol gland involved in plaque secretion. Several minor 
NBT-positive proteins of intermediate electrophoretic 
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Figure 2. HPLC chromatography profiles of Mefp-2 (-) and 

Mefp-1 (-) monitored at 280 nm. Proteins were resolved on a 0.7 

X 25 cm semi-preparative C 8 column with an acetonitrile gradient, as 
indicated by the inclined dashed line. Inset: AU-PAGE of purified Mefp- 
2 at three concentrations, stained with Coomassie Blue. The arrow in¬ 
dicates the position of trace Mefp-1 contaminants. 

mobility were also observed in this distal segment, but 
their origin and nature were not investigated further. To 
obtain a better separation between phenol and accessory 
glands, we micro-dissected phenol and accessory glands 
from thin foot sections and again, Mefp-2 was exclusively 
detected in the phenol gland, whereas Mefp-1 appeared 
in both glands. No other byssal glands are known to con¬ 
tain DOPA proteins. 

Extraction of byssal plaques in either acetic acid-urea 
or neutral SDS buffers yielded a polypeptide that co-mi- 
grated with Mefp-2 during both SDS- and AU-PAGE and 
stained positively with NBT (Fig. lb, c); miscellaneous 
electrophoretic species that stained with Coomassie Blue, 
but not NBT, also appeared. Preliminary amino acid 
analysis following HPLC purification of this NBT-positive 
plaque polypeptide revealed a composition very similar 
to authentic Mefp-2 (Diamond and Waite, unpub.). No 
polypeptide extracted from byssal threads alone exhibited 
NBT reactivity or electrophoretic behavior akin to Mefp- 
2, and no polypeptides corresponding to intact Mefp-1 
were detected in plaque or thread extracts. Dot blots con¬ 
firmed that a considerable proportion (ca. 10%) of acid- 
soluble plaque protein was NBT-sensitive, and immu- 
noreactivity assays demonstrated that the extracted ma¬ 
terial that reacted with anti-Mefp-2 antibodies was specific 
to plaques (Fig. Id). Prc-immune scrum was unreactive 
in these assays. These observations arc complicated by 
some cross-reactivity between the as yet unpurified poly¬ 
clonal anti-M and pure Mefp-1, but the electropho¬ 
resis and dot I ults together strongly suggest that, 
because Mefp- ved to occur in both threads and 

plaques (Benedict \ aite, 1986a), Mefp-2 is an im¬ 
portant component >f the plaque. 

Purification and cotnj n Mcf'p-2 

Considerable separation of Mefp-1 and -2 was obtained 
by chromatography of the proteins on Sephadex G-150 


or G-200 in 5% acetic acid (G-200 gave better separation 
but was not reuseable). Mefp-2, the predominant com¬ 
ponent of the trailing Sephadex fractions, was rechro¬ 
matographed on Sephacryl S-300 or S-400, because HPLC 
alone failed to separate Mefp-2 from residual Mefp-1. Al¬ 
though the difficulty of assessing the relative contributions 
of either DOPA protein to an unknown mixture preclude 
presentation of a quantitative purification table, final 
yields of pure Mefp-2 ranged between 1-5 mg from 30 g 
wet mussel feet, depending on the season of the year in 
which the mussels were collected (mussels collected in 
winter gave better yields than in other seasons). Mefp-2 
was resolved by AU-PAGE into two predominant elec¬ 
trophoretic bands (Fig. 2, inset), although minor bands 


Table 1 


Amino add composition of Mefp-2 and related proteins a 


Amino acid. 


CAMC- 



symbol 

Mefp-2 

Mefp-2 

Mefp-1 b 

Plaques 0 

3-Hyp 

0.0 

0.0 

42-27 

1.6 

4-Hyp 

0.0 

0.0 

161-120 

28.4 

Asp, D or Asn, N 

127.0 

115.1 

12 

88.3 

Thr, T 

42.2 

38.8 

113 

32.6 

Ser, S 

73.9 

65.8 

93 

66.5 

Glu, E or Gin, Q 

49.2 

43.7 

5 

50.0 

Pro, P 

111.9 

91.8 

41-82 

44.0 

Gly, Ci 

141.5 

132.6 

10 

219.9 

Ala, A 

38.6 

35.3 

81 

89.2 

Hcys 

68.2 

1.8 

0 

20.5 

Cys, C 

0.0 

155.3 

— 

— 

Val, V 

43.4 

37.3 

5 

34.0 

Met. M 

1.9 

1.0 

2 

5.9 

lie, I 

9.0 

8.2 

8 

20.3 

Leu, L 

12.6 

11.7 

trace 

44.6 

Dopa, Y* (or ¥) 

28.6 

25.8 

181-110 

14.8 

Tyr, 

56.0 

51.7 

31-73 

46.2 

Phe, F 

9.6 

9.0 

0 

10.0 

His, H 

8.0 

10.1 

3 

46.0 

Lys, K 

134.7 

124.7 

214 

51.8 

Arg. R 

43.7 

40.2 

3 

61.0 


a Amino acid compositions are given as residues per thousand (RPT) 
and are averaged from several preparations with a standard deviation of 
approximately 5%. The Mefp-2 composition was derived as described 
in Materials and Methods to obtain both DOPA and cystine values. 
CAMC-Mefp-2 was generally hydrolyzed by the rapid hydrolysis method, 
and cysteine was quantitated as S-carboxy methy lcysteine. Ranges of val¬ 
ues for certain amino acids in the Mefp-1 composition were derived 
from successive fractions under the peak of HPLC purified protein, and 
reflect differing degrees of hydroxylation of Pro and Tyr. Few such dif¬ 
ferences were apparent in Mefp-2. Cys, Lys and DOPA values are un¬ 
derlined for convenience. 
b Rzepecki cl al. (1991). 
c Benedict and Waite (1986b). 
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Elution Time (min) 

Figure 3. HPLC chromatography profiles of control Mefp-2 

(-), and CAMC-Mefp-2 (.) and COMC-Mefp-2 (.), 

reduced and alkylated as indicated in Materials and Methods, and mon¬ 
itored at 280 nm. Proteins were resolved on a 0.46 X 25 cm analytical 
C g column using the same gradient program as shown in Figure 2 (inclined 
dashed line). Concentration of control Mefp-2 was half that of the al¬ 
kylated derivatives, but the detector attenuation was halved. 


also found in Mefp-2, in addition to other significant dif¬ 
ferences. 

A comparison of the Mefp-1, Mefp-2, and byssal plaque 
amino acid compositions allows a crude estimate of the 
contributions of the two DOPA proteins to the overall 
plaque composition. Assuming that Mefp-1 is the only 
source of 3-hydroxyproIine in byssus (Waite, 1983), the 
amount of Mefp-1 in the plaque may be estimated from 
the plaque 3-Hyp value (Benedict and Waite, 1986b) to 
be 4-5 mol % of plaque protein, and the proportion of 
DOPA residues belonging to Mefp-I can be calculated. 
On the further assumption that all the remaining DOPA 
belongs to Mefp-2, a round figure of 25 mol % may be 
calculated for the plaque content of Mefp-2. At this pos¬ 
tulated concentration. Mefp-2 would account for about 
90% of plaque cystine and 70% of plaque lysine residues. 
Although these precise numbers should be taken cum 
grano satis [since (i) 3-Hyp occurs at low levels (<0.2 mol 
%) in the plaque and is difficult to quantitate; (ii) the con¬ 
tent of the readily oxidized DOPA residues is easily un¬ 
derestimated, even in fresh byssus; and (iii) minor proteins 
containing DOPA or 3-Hyp may occur], they are consis¬ 
tent with the apparent proportion of Mefp-2 extractable 
from plaque and foot (Fig. 1). These considerations thus 
establish an upper limit to the content of Mefp-2 in the 
plaque. 


of similar mobility were often apparent. All reacted pos¬ 
itively with the NBT and o-diphenol-specific Arnow stains 
(not shown), consistent with presence of DOPA in the 
amino acid composition. Mefp-2, resolved by SDS-PAGE 
(see below), did not stain with the PAS procedure for cis- 
diol sugars, though control proteins in the same gel were 
PAS-positive, suggesting that Mefp-2, in common with 
Mefp-1, is not a glycoprotein. 

The most dramatic compositional difference between 
Mefp-1 and -2 (Table 1) is the occurrence of cystine ex¬ 
clusively in Mefp-2 and at remarkably high levels (6-7 
mol %). A comparison of native Mefp-2 with reduced and 
S-carbamidomethylated protein (CAMC-Mefp-2), to¬ 
gether with the exceedingly low levels of S’-carboxyme- 
thylcysteine in the acid hydrolysates of native Mefp-2 that 
had been treated with iodoacetamide in 4.8 or 6 M urea 
without prior reduction by DTT, shows that most of the 
S-alkylated cysteine (ca. 15 mol %) had originally been 
in the disulphide form. The DOPA content of Mefp-2 (2- 
3 mol %) was much lower than that of Mefp-1 (11-18%) 
and, unlike other molluscan DOPA proteins (Rzepecki 
el at., 1991), the Lys:DOPA ratio far exceeded unity. Both 
contained proline, but no hvdroxylation to 3- and 4-Hyp 
occurred in Mefp-2. Glycine, rare in Mefp-1, made up 
about 14 mol % of Mefp-2. Elevated levels (about 5-fold) 
of Asx and Glx, and lower levels of Ser and Thr, were 


HPLC amt electrophoretic characterization of Mefp-2 

HPLC and electrophoretic analyses of native, as well 
as reduced and N-alkylated Mefp-2 revealed a challenging 
complexity in chromatographic and electrophoretic be¬ 
havior. The alkylated Mefp-2 derivatives eluted more 
slowly than control protein on HPLC, with the N-car- 
boxymethylated protein (COMC-Mefp-2) unexpectedly 
the slower of the two (Fig. 3). Mefp-2 that had been re¬ 
duced with DTT, but not alkylated, eluted with a delay 
comparable to COMC-Mefp-2 (not shown). Denaturing 
AU-PAGE profiles of fractions under the chromato¬ 
graphic peaks of Figure 3 showed that the two major bands 
exhibited by native Mefp-2 had been converted, upon re¬ 
duction and alkylation, into multiplets with four or more 
components (Fig. 4a). Both COMC- and CAMC-Mefp-2 
migrated considerably more slowly than the native poly¬ 
mer, with COMC-Mefp-2 again the slower of the pair. 
Mefp-2 that had been reduced by DTT, but not alkylated, 
migrated with intermediate mobility, but upon storage in 
water at 4°C, partially regained the faster mobility char¬ 
acteristic of native Mcfp-2 (not shown). All electrophoretic 
species stained positively for DOPA with both NBT and 
Arnow tests (not shown). 

Electrophoretic mobility on denaturing AU-PAGE 
systems depends both on molecular weight and charge 
density (Hollecker, 1990). Although alkylation of Mefp-2 
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1 iguri 4 (\) AU-PAGE analysis of sequential HPLC fractions under the chromatographic peaks of 

Figure nes 1-3, native Mefp-2; lanes 4-8, CAMC-Mefp-2; lanes 9-12, COMC-Mefp-2. The arrow 
indicates ti xpected position of Mefp-I. Gel stained with Coomassie Blue. (B) SDS-PAGE (12% poly- 
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would increase the molecular mass of the protein by 
some 3-3.5 kDa (assuming about 56-62 alkylated cys¬ 
teines per protein of 42-47 kDa with 360-400 amino ac¬ 
ids), this increase alone seems insufficient to account for 
the reduced mobility under AU-PAGE, so modified 
charge density and alterations in protein conformation 
induced by disulphide reduction and alkylation undoubt¬ 
edly contributed to the electrophoretic differences. The 
overall similarity between the relative mobilities of 
CAMC- and COMC-Mefp-2, which have widely different 
pi values (see below), together with their common differ¬ 
ence from native Mefp-2, which has the same pi as 
CAMC-Mefp-2, argue for the prc-cminencc of confor¬ 
mational changes. A transformation from a compact to 
an extended configuration upon reduction and alkylation 
is consistent with the observed anomalous electrophoretic 
and chromatographic migration. 

Analysis of Mefp-2 and its alkylated derivatives by SDS- 
PAGE, following reduction in 50 m \f DTT, was similarly 
complex (Fig. 4b). In contrast to Mefp-1, which precipi¬ 
tates in SDS, native Mefp-2 usually exhibited two elec¬ 
trophoretic bands (at low loading concentrations) with 
apparent molecular masses in the range 42-47 kDa, al¬ 
though minor NBT-positive bands migrating in the same 
range were frequently present. Electrophoretic migration 
was, however, somewhat variable within that range and 
appeared dependent on the protein concentration and de¬ 
gree of reduction. Several electrophoretic bands [which 
exhibited positive NBT and Arnow reactions (not shown)] 
were resolved for CAMC- and COMC-Mefp-2, with ap¬ 
parent molecular masses in the ranges 48-54 kDa and 
63-71 kDa, respectively. The decreased mobility of 
CAMC-Mefp-2 was almost, yet not quite, explained by 
the molecular mass increment (3-3.5 kDa, see above) re¬ 
sulting from alkylation, but COMC-Mefp-2 migrated 
considerably more slowly. S -Alkylation with iodoacetate 
is well known to induce electrophoretic anomalies (Lane, 
1978). 

Isoelectric focussing (1EF) studies in highly reducing 
and denaturing media (Fig. 4c) showed that the apparent 
pi of native Mefp-2 lay between 9 and 10 pH units (/>., 
largely beyond the resolving power of available ampho¬ 


lytes), similar to that for Mefp-1. CAMC-Mefp-2, which 
theoretically has the same net charge as native Mefp-2, 
migrated identically (not shown). COMC-Mefp-2, with a 
greatly decreased net positive charge owing to the addi¬ 
tional carboxyl moieties, migrated as a family of at least 
ten distinct members, with pi values in the range 4.7-5.9. 
Analogous electrophoretic heterogeneity occurs in the 
DOPA proteins of Fasciola hepatica (Waite and Rice- 
Ficht, 1989). 

The variable migration of native Mefp-2 during SDS- 
PAGE caused us to investigate further the effects of DTT 
on electrophoretic behavior in this system. In the absence 
of DTT, Mefp-2 could barely be induced to enter even 
the stacking gel of discontinuous SDS-PAGE systems (Fig. 
4d). As the DTT concentration was increased to 50 m M, 
progressively more Mefp-2 entered the resolving gel and 
migrated in a ladder-like pattern consistent with the pres¬ 
ence of higher molecular weight aggregates of 2, 3, 4 or 
more polypeptide chains. At the highest DTT concentra¬ 
tions, most of the Mefp-2 migrated as an apparent mono¬ 
mer (cf. Fig. 4b). The obvious explanation for this obser¬ 
vation. that the oligomeric Mefp-2 resulted from inter- 
molecular disulphide bonds, does not seem tenable, since 
significant high molecular weight aggregates were rarely 
observed upon resolution of the same protein preparations 
by denaturing AU-PAGE (although they sometimes oc¬ 
curred on prolonged storage when frozen in water— cf. 
also Figs, lb, c and 2, with Fig. 4a, d). Apparently, then, 
conditions exist for the aggregation of nativ e, but not re¬ 
duced, Mefp-2 in solution, although the precise contri¬ 
bution, if any, of SDS to this phenomenon remains to be 
determined. This is consistent with the precipitation fre¬ 
quently noted in concentrated neutral solutions of Mefp- 
2, and with preliminary X-ray scattering data (Trumbore, 
pers. comm.). 

Primary structure of Mefp-2 

Despite the apparent electrophoretic heterogeneity of 
Mefp-2, N-terminal sequencing of pure protein revealed 
a single detectable N-terminus, H 2 N-Thr-Ser-Pro-Xaa- 
Yaa-Dop-Asp-Asp-Asp-Glu . . . , where Xaa and Yaa 


acrylamide) analysis of Mefp-2 and derivatives. Lane 1 native Mefp-2: lane 3, CAMC-Mefp-2: lane 4, 
COMC-Mefp-2: lanes 2 and 5. molecular weight standards. Gel stained with Coomassie Blue. (C) 1EF 
analysis of Mefp-2. Lane 1, native Mefp-2: lane 2. COMC-Mefp-2; CAMC-Mefp-2 migrated identically to 
native Mefp-2 and is not shown. Proteins were focussed on separate gels in parallel, and the gels were stained 
with the NBT redox cycling assay. The pH gradient is indicated on the left. (D) Effects of DTT on native 
Mefp-2 migration during SDS-PAGE (12% polyacry lamide). Lanes 1 and 6 are molecular weight standards. 
DTT concentrations: lane 2, 0 m \I\ lane 3, 0.5 m M\ lane 4, 5 m l/: lane 5. 50 m M. The gel was stained 
with Coomassie Blue. Lanes 3'-5 f are from a gel run in parallel, but stained with the NBT redox cycling 
assay, and correspond to lanes 3-5 in the gel on the left. 
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were predominant!) P 1 \rg, though Tyr, Gly, Thr 
and Gin were also 1 [ able 11). Treatment of Mefp- 

2 with aminopet * eleased low levels of Thr, Ser, 

Pro, DOPA consistent with the sequenator 

analysis. iCal combination of variability and 

constan -terminus might be explicable in terms 

of multiple genes or alternate mRNA splicing mechanisms 
(cj. Bobek et ai , 1988; Pihlajaniemi and Tamminen, 
1990), but analysis of Mefp-2 from an individual organism 
has not yet been possible. 

Native Mefp-2 (unlike Mefp-1) was relatively resistant 
to a variety of proteases, including trypsin, chymotrypsin, 
pepsin. Staphylococcus aureus V8, and collagenase, even 
at unusually high protease:Mefp-2 ratios (up to 1:1) in 
the presence of urea, although increases in electrophoretic 
mobility and heterogeneity were often apparent upon 
electrophoresis (see Fig. 5a, inset, for AU-PAGE of tryp¬ 
sin-treated Mefp-2). SDS-PAGE in the presence of DTT 
showed that, upon trypsin treatment, the apparent mo¬ 
lecular mass decreased from about 45 kDa to 30-35 kDa 
with some increase in heterogeneity (not shown). HPLC 
of trypsin-treated native Mefp-2 resolved some minor 
peptides (Fig. 5a), but reduction and alkylation of the 
residual disulphide bonded protein did not result in the 
recovery of any new peptides upon rechromatography, 
confirming that little internal nicking had occurred. 
Trypsin digestion of both COMC- and CAMC-Mefp-2 
converted both alkylated derivatives to their component 
peptides (Fig. 5). These results suggest that trypsin 
trimmed native Mefp-2 at the N- or C-terminus, or both. 


leaving the residual disulphide bonded core protein largely 
intact. 

Peptides obtained by HPLC chromatography of trypsin- 
treated native and alkylated Mefp-2 fell generally into 
three major classes, all containing either tyrosine or DOPA 
(Figs. 6, 7). All three types were isolated from alkylated 
Mefp-2, but only Type II peptides were identified in digests 
of native Mefp-2. Owing to the chromatographic com¬ 
plexity of the peptide mixture, the relative content of the 
three classes was rather difficult to determine with any 
precision. Most fractions contained two major peptides 
(usually in unequal proportions), and sequences for pep¬ 
tides in various HPLC fractions were assigned by consid¬ 
eration of relative yields of amino acids in successive se¬ 
quenator cycles. Fortunately, sequence consistencies be¬ 
came readily apparent, and a consideration of Figures 5- 
7, assuming that UV absorbance at 230 nm broadly re¬ 
flects peptide concentration, suggests that the majority (at 
least 80%) of Mefp-2 can probably be accounted for by 
the identified motifs. 

Type I peptides. This type of motif appeared to pre¬ 
dominate in Mefp-2, with identified peptides accounting 
for perhaps 40-50% of the 230 nm absorbance of Figure 
5b, although no individual peptide contributed more than 
5-10%. It was highly enriched in Cvs, Gly, and Pro, and 
was basic due to the excess of Lys and Arg over GIu and 
Asp. Significantly, we found no evidence that the Tyr res¬ 
idues were hydroxylated to DOPA in this type of motif. 
For convenience, we have subdivided the Type I motif 
into six submotifs (Fig. 6), which combine variously to 


Table II 


N-terminal analysis of Mefp-2* 


Sequenalor analysis 



Amino acid (yield, pMoles X 10 3 ) 

Aminopeptidase analysis 

Cycle # 

Fraction 5 

Fraction 6 

Amino acid (RPT) 

1 

T (2.0) 

T (1.5) 

S(225, 219) 

2 

S (1.8) 

S (1.4) 

T (185, 178) 

3 

P (2.2) 

P (1.8) 

P (171, 187) 

4 

R (2.1) 

P (1.1) R (0.6) Y (0.5) G (0.5) 

R (70, 88) 

5 

P (1.0) T (0.5) V (0.5) Q (0.4) 

P (1.2) Q (0.9) 

Y* (64, 48) 

6 

Y* (1.7) 

Y* (E4) 

D/N (32, 25) 

7 

D(1.5) 

0(1-6) 

[G (93, 87)] 

8 

0(1.6) 

0(2.1) 


9 

O (2.3) G (1.2) 

D (2.1) 


10 

veral amino acids detected in 

significant quantities (D, E, G, K , Y) 



a N-Terminal sequences an opeplidase M analyses were obtained as described in Materials and Methods. Sequences were derived from two 
adjacent fractions under the p< n HPL C elution profile of Mefp-2, similar to that of Figure 2. In the aminopeptidase analysis, amino acid 

compositions from two individu riments (with aminopeptidase incubation times of 2.5 and 16 h). are reported as residues per thousand (RPT) 

for the most abundant amino remainder ranging between 0-30 RPT. The yields were low, perhaps due to protein aggregation, adsorption 

in neutral buffer, or both; thus the gl\c e detected probably reflected background contamination of the vessels or protein preparations. One letter 
amino acid codes are given in Table 1. 
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Figure 5. HPLC chromatography profiles of Mefp-2 and derivatives 
following proteolytic digestion with trypsin. Peptides were resolved on 
a 0.46 X 25 cm C 18 column using acetonitrile gradients as indicated by 
the dashed lines. Fractions of 1 ml were collected at a flow rate of 1 ml/ 
min. (A) Peptides and undigested core protein (fractions 68-72) derived 
from native Mefp-2. Inset: AU-PAGE analysis of native Mefp-2, CAMC- 
Mefp-2 and COMC-Mefp-2 before (lanes 1-3, respectively), and after 
(lanes 4-6, respectively) proteolysis with a trypsin:protein weight ratio 
of 1:50, or 1:25 in the case of native Mefp-2. Equivalent concentrations 
of protein were run in each lane, and the gel was stained with Coomassie 
Blue. (B) Peptides derived from COMC-Mefp-2. The peptide map derived 
from CAMC-Mefp-2 was qualitatively similar to that shown in (B). 


Consensus (Type I): 
Peptide # 

COMC CAMC _ 


38 

50b 

64b 

57a 

48a 

49a 

52a 

62a 

63a 

63b 

64a 

66a 

69a 

68 


71 

58 

55b 

31 

34 

53 

49b 

55a 


32b 


53b 

46b 


51b 

52b 

51a 

53a 


54b 

54a 

52b 

27/28 

32a 

48 


Cystine Enriched Tryptic Peptides 

Oligopeptide Submotifs 
1 2 3 4 5 6 

[£Y] QJ GGY3G PTC QENAC KPNPQ [X]K 
T L GJ V A 

_Seq uence _ 

Type I peptides 


Q APLGK 
VNVC KPN... 
XICAR1 PCK 
YVQAPNPQK 
C GENMC KPNPC QNK 
V A 

C QVHAC KPNPC N{N/T}K 
C GVN?£ NPNFC K 
PTQ GENVQ APNPQ ££K 
PJQ GVNAQ KPNFQ QK 
F 

PTQ NVNAQ KPNP- IK 
PTQ QENAQ KPNFQ K 
PIC QENAC KPNPC SN... 
PIC -HD— KPNPC N... 

PTC -ED— KPNPC NIK 
NV 

FQ- - • 

LT... 

PIC QENAC KPNPC N... 


GGYSG 

GGY33 

QS GGYSG 
QV GGYSS 
CTCV GGY33 
CL GGYT3 
CL GGXEG 
V 

CYC V GGY33 
CYCT GGYCG 
QYFD NSDDG 
QYW K 
QYFD CSC 

GGYSG 

[S]GYYG 

Q 

gtgyvgyig 

GSSYN- 


P( K) 
PEQ . 


--C ic K 


73a 

69b 


Others 

(CSADK] FCDYICDFRPGYFGPE {Q/Q}. 
GGYGKFNDYLE... 


form both complete and truncated motifs. Tyrosine typ¬ 
ically occurs in submotif #2, GGYSG, in the company 
of small residues. This propinquity of Tyr and small res¬ 
idues is common in tyrosine- or DOPA-containing struc¬ 
tural proteins from species in various phyla (e.g., Johnson 
el ai, 1987; Waite and Rice-Ficht, 1987, 1989; Bobek et 
ai, 1988; Mehrel el ai , 1990; Aggeli el ai, 1991; Rzcpecki 
el ai, 1991); it has been attributed in part to the hypo¬ 
thetical steric requirements of 3-tyrosyl hydroxylases 
(Rzepecki et ai, 1992), although this feature is clearly 
insufficient for tyrosine hydroxylation in Mefp-2. Tyrosine 
also occurred in submotif #1 with vicinal Cys residues; 
this relationship has never been observed in a protein 
where tyrosine is known to be hydroxylated to DOPA, 
but occurs infrequently in other glycine, cystine, and ty¬ 
rosine enriched structural proteins, although replacement 
of Tyr by Cys in (G)„Y/C sequences is more usual (e.g., 
Bobek el ai, 1988; Mehrel et ai, 1990; Aggeli et ai, 1991). 


Figure 6. Cystine-enriched tryptic peptide sequences derived from 
alkylated Mefp-2. Numbers under the COMC heading correspond to 
fractions obtained from the HPLC chromatogram depicted in Figure 5b 
(i.e., peptides from COMC-Mefp-2): those under CAMC correspond to 
HPLC fractions of peptides from CAMC-Mefp-2 (not shown). Letters 
after the numbers indicate whether the peptide was a major (a) or minor 
(b) component of the HPLC fraction. 1 n cases of ambiguity, w here more 
than one peptide was found in the fraction, sequences were assigned by 
consideration of relative PTC-amino acid yields in successive sequenator 
cycles. Sequences in italics are those obtained from both COMC- and 
CAMC-Mefp-2 preparations. DOPA and alkylated cysteine residues are 
emphasized, and sequences are spaced as submotifs for convenience only. 
Residues in parentheses () were inferred from composition data, or were 
difficult to assign owing to low yields or ambiguous results in a sequenator 
cycle. Residues in brackets [ ] are optional sequence extensions in peptide 
variants. Residues separated in braces {/} are alternative amino acids 
found in that position of the sequence: in other cases, alternative amino 
acids are shown below the main peptide sequence. Ellipses (...) indicate 
probable continuation of a peptide beyond the last residue clearly resolved 
on the sequenator; dashes (-) indicate gaps introduced into the se¬ 

quence to optimize putative alignment. The symbol X in the consensus 
sequence indicates one or two optional small polar amino acids. 
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Submotifs #3 (PTC) a A ' (KPNPC) are enriched in Cys 
and Pro and are, i the GG\ SG submotil. 

the most conser Submotif #4 is relatively 

variable, with s of Gly for Gin, Val for Glu, 

and Ala for \ i the terminal Cys residue appears 

invariant. ^initial submotif (#6) consists of 0-2 

small polar residues (Asn, Gin, etc.) with a C-terminal 
Lys or Arg. A search of the SW1SS-PROT Protein Se¬ 
quence Data Bank [release 19 (August 1991) with 21,795 
entries] failed to identify any remarkable homologies be¬ 
tween the Type 1 motif and any other known sequence. 
[Note, however, the conceptual similarity between Type 
I motifs and conotoxin sequences. Conotoxins, isolated 
from the venomous marine cone snails (Conus), are small 
(10-30 amino acid) multiply disulphide bonded peptides, 
with 2-4 highly variable loops of 1-6 amino acids between 
adjacent Cys residues, and are potent antagonists of var¬ 
ious neuromuscular receptors (Olivera et al ., 1990)]. The 
Cys-rich nature of Type 1 motifs, as well as their relative 
abundance, suggests that they are derived from the core 
of Mefp-2, which is resistant to trypsin digestion in the 
non-alkylated state. If they are thus concentrated, then 
such an arrangement would result in a quasi-periodic 
distribution of Cys and Tyr residues over much of the 
protein. 

Type II peptides. These motifs were enriched in Pro 
and Ser, and were highly acidic, containing almost pal¬ 
indromic runs of Asp and Glu bracketed by DOPA 
residues (Fig. 7). The first DOPA was often found at 
the C-terminus of a PPSY* (Y* = DOPA) sequence, an¬ 
alogous to a submotif of the Mefp-1 tandem re¬ 
peat (AKP*SY*P*P*TY*K where P* = hvdroxyproline; 
Waite, 1983). The second DOPA had Asp as its N-ter- 
minal partner with, frequently, Thr at its C-terminus; 
similar arrangements occur in the high molecular weight 
DOPA proteins from other molluscan species, but are 
foreign to Mefp-1 (Rzepecki et al., 1991). No Cys or Gly 
residues were found, thus sealing the distinction between 
Type 1 and Type 11 sequences. The Mefp-2 N-terminus, 
TSPxxY*DDD(E) . . . (Table II and Fig. 7), is a Type II 
sequence, and was the only N-terminus detected (see 
above). The C-terminal sequence (K)SPPSY*NDDDEY* 
was identified in several independent peptide preparations 
by the close espondence between the observed amino 
acid compos which lacked Lys or Arg (in trypsin 
derived prepai but had equimolar DOPA, Ser, and 

Pro contents, expected composition calculated 

from the sequent ultimate DOPA residue was in¬ 
ferred largely fron ition data, since sequenator 

yields of C-terminal "re greatly reduced, though 

still marginally evident cases. No other peptides 

were reliably identified terminal. Thus the DOPA 
rich N- and C-terminal sequences of Mefp-2 are probably 
structurally similar and, in contrast to the basic Type I 


Dopa Enriched Tryptic and Chymotryptic Peptides 

Peptide # 


COMC CAMC 

Native 

Seauences 

Comments 


Type II peptides 



^Untreated 

TSPxxYDDD (E)... 

N-Terminus 

66b 


TSPDPYDDDEDDyrPPV (R) 


62b 


T-YDDDEDDYIPPV (R) K 



^Trypsin 

TSP-DDEDD?TPPV (K) 

Dopa prominent in 



D P DE 

composition but not 



Q 

detected in sequence. 


16 

TSP(R) ..? 



tChyraotrypsin 

KSPPSYNDDDE (Y) 

C-Terminus? 


^Trypsin 

SPPSYNDDDE QT) 

C-Terminus? 

45a 

43/44 

SPPSYNDDDE (Y) 

C-Terminus? 


tChyraotrypsin KSPPSNNDD... 




Type III peptides 


52b 


AY —NRQfKR 


47 a 


AXKPNPCASR[PGYR] 


48a* 


AYKPNPCASG-PCPK 




IN... 


49* 


AYKPNPCASS-PCtK) 


58* 


AYKPNPgWSKPC (£)... 




Others 


70* 


YFPfiCPSYN3.,. 


48b 


NT£I£GYNGSGGYR 


50a 46a 


YNG^KP {N/S} GGSYK 

47b 


YNGV... 




Y 


48b* 


SNGVCK(F) SG... 




Y 


19 

^Trypsin 

GYSCR 




Y K 


62* 


Y?PVNP£LKK 



70 

XSPVN... Core Protein N-Terminus?? 



Y 


68* 


GGYYGYNQ^.. 



Figure 7. DOPA-enriched tryptic peptides from native Mefp-2 and 
alkylated derivatives. Numbers under the Native heading correspond to 
fractions from Figure 5a, those under COMC and CAMC headings are 
as in Figure 6. Numbers marked with an asterisk (*) correspond to pep¬ 
tides isolated by phenyl boronate affinity chromatography prior to HPLC 
purification (not shown). The symbol (t) indicates data from earlier pre¬ 
liminary experiments using Mefp-2 prepared from different batches of 
mussels. “Untreated” Mefp-2 refers to the N-terminal analysis. Peptides 
from native Mefp-2 specifically labeled as “trypsin” derived were obtained 
by digestion of Mefp-2 in 100 mAf Tris, 50 m M ascorbate buffer, pH 
8.0 for 24 h at room temperature with a trypsin:protein weight ratio of 
1:10, then isolated by HPLC on a 0.46 X 25 cm C 8 column with the 
same elution gradient as used for protein isolation. Peptides were detected 
at 280 nm. Peptides specifically labeled as “chymolrypsin” derived were 
obtained by digestion of Mefp-2 in 150 m Al Tris, 150 m Al ascorbate 
buffer, pH 7.5, al room temperature for 6 h with a chymotrypsin:protein 
ratio of 1:1, then resolved by HPLC, following lyophilization and re¬ 
suspension in 5% acetic acid, on a 0.46 X 25 cm Vydac Ci 8 column with 
a 0-25% gradient of acetonitrile in water, incorporating 0.1% trifluoracelic 
acid. Because few peptides were obtained by either of these protocols, 
the HPLC profiles have been omitted. Other symbols and protocols are 
as described in Figure 6, except V which is a symbol for DOPA to facilitate 
peptide alignment. 
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motifs and the DOPA-containing motifs from other 
DOPA proteins, are highly acidic. 

Type III peptides. This last major type contained both 
DOPA and Cys residues, in addition to Pro, Lys, and Ala. 
The DOPA residue was found in the sequence 
AY*KPNPC . . . , where the KPNPC segment is iden¬ 
tical to submotif #5 of the Type 1 motif. Although there 
is some compositional similarity to the Mefp-1 consensus 
motif, the only sequence identity is in the dipcptide Y*K, 
which is a common feature of many molluscan DOPA 
proteins (Rzepecki et al ., 1991). The distribution of 
Type III peptides within the Mefp-2 sequence is 
unknown. 

Minor peptide sequences were apparent, including 
some unusual vicinal YY and Y*Y dipeptides previously 
found in DOPA proteins from the mussels Modiolus mo¬ 
diolus and Trichomya hirsutci (Rzepecki et al., 1991), and 
certain tunicates (C. Hawkins, pers. comm.). Although 
variation in Mefp-2 is considerable, the domination of 
the heterogeneous Mefp-2 family by a limited number of 
motif variants indicates that it is tandemly repetitive. 
However, unlike most DOPA proteins, the motif degen¬ 
eracy is quite marked, and the sequences often diverge 
markedly from those expected on the basis of previously 
analyzed DOPA proteins (Rzepecki et al, 1991; Rzepecki 
and Waite, 1991). Moreover, Mefp-2 exhibits no sequence 
homologies with any previously characterized cysteine- 
or cystine-containing proteins, including disulphide 
bonded collagens (Dublet and Rest, 1991), high sulphur 
keratins (Fraser et al., 1972), zinc-binding proteins 
(DuBois et al ., 1990: Opipari et al., 1991), metallothi- 
oneins (Lerch, 1980: Nemer et al., 1985), keratinocyte 
loricrins (Mehrel et al., 1990), proteoglycans such as per- 
lecan (Noonan et al. , 1991), and adhesive laminins of the 
neuromuscular junction (Hunter et al, 1989). Mefp-2 thus 
appears to be a novel class of structural protein, and any 
short sequence similarities to other classes may be sto¬ 
chastic or result from common structural features related 
to function. 

Conclusions 

The blue mussel, Mytihis edulis, incorporates at least 
two distinct DOPA proteins into its byssus. The first of 
these to be characterized (Waite, 1983), Mefp-1, forms a 
protective varnish at the interface between byssus and 
environment, and may also be the adhesive agent at the 
interface between plaque and substratum (Benedict and 
Waite, 1986a). The second major DOPA protein of the 
blue mussel, Mefp-2, appears to be a structural component 
exclusively of the plaque, contributing up to 25% of plaque 
protein. (About 90-95% of the dry weight of the M. edulis 
plaque is proteinaceous; Diamond and Waite, unpub¬ 
lished.) Mefp-2 seems to be a tandemly repetitive, mul¬ 


tidomain protein, with short, acidic, DOPA-containing 
N- and C-terminal regions, and a large central domain 
constrained by quasi-periodic internal disulphide bridges 
to a compact conformation resistant to proteolytic deg¬ 
radation. The peptide motifs of Mefp-2 are quite unlike 
those of any other known structural proteins. Speculations 
about secondary structure are premature, because in tan¬ 
demly repetitive proteins, the correct order of short peptide 
motifs over an entire protein cannot be deduced by stan¬ 
dard peptide mapping techniques, and it is debatable 
whether all members of the heterogeneous Mefp-2 family 
share identical subsets of motifs. Nevertheless, the se¬ 
quence of the Type 1 motif (Fig. 6) of Mefp-2 contains 
proline and/or glycine rich segments that alternate with 
sequences containing none of these /Tturn associated res¬ 
idues, and it is therefore consistent with conformational 
models, such as the /^-meander postulated for certain gly¬ 
cine and tyrosine rich Schistosoma eggshell proteins 
(Rodrigues et al. 1989). 

The composition of Mefp-2, incorporating cystine and 
DOPA, implies some role involving the stabilization of 
the plaque matrix by covalent disulphide and quinone- 
derived cross-links. In vitro, Mefp-2 can form oligomeric 
aggregates that might, in time, be stabilized by rearrange¬ 
ment of disulphide bonds to form inter-molecular cross¬ 
links. If Mefp-2 indeed constitutes 25% of plaque protein, 
then it would incorporate 90% of the plaque cystine res¬ 
idues and thus would be virtually its own sole potential 
disulphide cross-link partner. The DOPA residues might 
then serve to cross-link the resulting disulphide-linked 
Mefp-2 homopolvmer to other protein components of 
the plaque, although the apparent extractability of Mefp- 
2 from day-old plaques suggests that, if such cross-links 
are real, they form slowly. Much of the internal volume 
of the blue mussel plaque is occupied by a microscopic 
foam which does not bind anti-Mefp-1 polyclonal anti¬ 
bodies strongly (Benedict and Waite, 1986a). Disulphide- 
rich proteins (Kitabatake and Doi, 1987; Okumura et al, 
1989), and rigid proteins in general (Graham and Phillips, 
1976; Hailing, 1981), arc known to stabilize foams, and 
we had previously noted that Mefp-1 should be a good 
foam stabilizer by virtue of its cross-linking potential 
(Rzepecki and Waite, 1991). Mefp-2 also appears suited 
for such a role, although until it can be precisely localized 
within the plaque matrix, its function will remain con¬ 
jectural. [Preliminary attempts to extract acid-soluble 
proteins analogous to Mefp-2 from the foot of the ribbed 
mussel Geukensia demissa, which secretes a high molec¬ 
ular weight DOPA protein (Gdfp) analogous to Mefp-1, 
but lacks a distinctive foam structure in its plaque (Waite 
et al, 1989), have failed to detect significant quantities of 
any putative Gdfp-2 (Rzepecki, unpub. data): a correlation 
of habitat and plaque ultrastructure with DOPA protein 
variants in other mussel species should be informative.] 
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Many questions, h?n* ..main unresolved. The ap¬ 
parent localization to the phenol gland is 

somewhat equiv< phenol gland is an apparent 

site of Mefp-’ storage (Waite, 1983; Benedict 

and Waite true nature of the phenol gland 

secretions is < ^azzling—do they contain one or both 

DOPA protein. The sequence differences between 
DOPA-containing motifs in Mefp-1 and Mefp-2 raise 
questions about the specificity and number of the putative 
tyrosyl hydroxylases responsible for conversion of peptidyl 
tyrosine to DOPA, and also about the nature of the cate- 
choloxidase responsible for peptidyl quinone production 
(although curiously, no catecholoxidase activity has yet 
been detected in the phenol gland). The reproducibility 
of specific disulphide bonds, and thus the degree of con¬ 
servation of secondary and tertiary structure, is unknown. 
Finally, although we still require a full representative se¬ 
quence from the Mefp-2 family, data obtained from 
cDNA or genomic sequences must be interpreted with 
caution, because the Mefp-2 family of proteins may prove 
to be a multigene family and may also be subject to al¬ 
ternative splicing of mRNA. The association of such phe¬ 
nomena with many repetitive structural proteins is well 
documented (e.g., Bobek et ai, 1988: Pihlajaniemi and 
Tamminen, 1990). 
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